|
|
Accession Number |
TCMCG075C12300 |
gbkey |
CDS |
Protein Id |
XP_017975083.1 |
Location |
complement(join(3094812..3094925,3095210..3095446,3096042..3096224,3096334..3096439,3096545..3096704,3097067..3097155,3097249..3097298,3097821..3097919,3098337..3098435,3099722..3099898,3100470..3100579,3100716..3100770,3101230..3101298,3101495..3101560,3101711..3102172)) |
Gene |
LOC18601132 |
GeneID |
18601132 |
Organism |
Theobroma cacao |
|
|
Length |
691aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018119594.1
|
Definition |
PREDICTED: histone-lysine N-methyltransferase, H3 lysine-9 specific SUVH4 isoform X1 [Theobroma cacao] |
CDS: ATGGTGGTTCAGTCGCCGCATTCGGTGGAGAGTTCTGTACCAACGGCTAAAAACGGAGGCAGAAGAGGGAAAGGTTCGTCGGAGAAGGAGAACGTGTCTGGACAGCGAAGAGCGAGTGCGAGATTGCTGGCCGCCAAAGAAAGAGCTGAGAAGGCGCTTTCGATAAAGCGGGGAGTGGAGGTCCTGGACCCGGAGGAAGATGGAGGCAGAAGCAGAAAAAAGGCTAATGTGGAGAGTGAGAAGCTAAATCTAAAAGATGCTCAAAATGGGAGCCCAAAACTTCCTGAGACTGAACCCAAAATCAACGAGAAAGTGGCGAAAATGGTAGAAAGAGCGGCCAAGATTGCAGAGGGACTTGATACTACTAATGCTAATGCTCCAAATGTTGTTGAGAAGAGTGCACATATTAAAGTGAAAGAGACTATAAGGTTGTTCAACAAGCATTATCTTCACTTTGTTCAGGAAGAGGAAAAGAGGTGCGGAGCGGTCAAGGTTGGAAAGAAAGCCCCCAAGGGCAAGAAGACCAAGAAAAGAGATGTATCTGAAGGTGATGGCAAGGGCAAGGCCAAGCGACCAGACTTGAAGGCAATAACAAAGATGATGGAGAAAAATGAGGTGCTTTATCCTGAGAAAACTATAGGCAGCCTTCCAGGCATTGATGTTGGTCATCGGTTTTATTCTCGTGCTGAAATGGTTGCTGTTGGTTTTCATAGCCATTGGTTGAATGGTATTGATTATATGGGACAGTCCTACAAGAAAGGGGAGTATGAACACTATATATTCCCACTTGGAGTAGCTATAGTTTTATCGGGCATGTATGAGGATGATTTAGATAATGCTGAAGATGTTGTCTATACTGGACAAGGAGGGCATGACCTAACTGGTAATAAACGTCAAATTCGGGATCAAGTTATGGAACGTGGTAATCTCGCACTCAAGAACTGTGTGGACCAAGGCGTGCCTGTCAGAGTAGTTCGTGGTCATGAATCTGCTAGCAGTTACTCTGGAAAAATTTATACATATGATGGCTTATACAAGGTTGTTAAGTACTGGGCAGAGAAGGGTATTTCTGGGTTTACCGTTTTTAAATATAGATTGAGGCGGCTTGAAGGACAGCCAACATTAACAACTAGCCAGGTTCAATTTACCTATGGGCGTGTTCCCAAGTGTCCTTCAGAAATTCGCGGGTTGGTGTGTGAGGACTTAAGTGGTGGTCAAGAGGATGTTCCCATTCCAGCAACTAATCTGGTTGATGATCCACCCGTTGCACCGACAGGTTTTACATATTGCAAGTCTATGAAAGTTGCACGAAATATAAAGCTCCCTTCTAATGCTGCTGGATGTGATTGCAAGGGAGTTTGCTGGGATCCGAAGGCTTGTGCTTGTGCCAGGCTTAATGGTTCTGATTTTCCATATGTGCACCGCGATGGTGGCAGATTAATAGAAGCCAAGCATATTGTTTTTGAATGTGGTCCAAAATGTCGCTGTAATGCTAATTGCGTGAATCGTACATCTCAGAGAGGATTGAAATATCGACTGGAGGTCTTCCGTACTCCAAAGAAAGGATGGGCTGTTAGATCATGGGATTTTATACCTGCTGGTGCCCCAGTTTGTGAATACATTGGAGTACTCACGAGGACAGAAGAACTGGATAATGTGTCTGAGAATAATTACATTTTTGACATTGATTGCTTGCAAACTATGAGAGGGCTTGGTGGCAGAGAGAGGCGGCAACAAGATGCGTCTTTGCCAATGATCCAGAACATGGACAAAAATGATGAACAGAGGTCAGAGAGTGTGCCAGAGTTCTGCATTGATGCTGGTTCTTTTGGAAATGTTGCAAGATTTATCAATCATAGCTGTGAGCCTAACCTCTTTATCCAGTGTGTCCTGAGTGCGCATCAGGATTTTAAACTAGCTCGAGTGATGCTCTTTGCAGCAGACAACATTCCCCCTTTGCAGGAGCTTACTTATGACTATGGTTATGCCCTTGATAGCGTTTATGGTCCTGATGGGAAGGTAAAACGGATGACCTGCTACTGTGGAGCAGAAGATTGCAGAAAGCGATTATTCTAG |
Protein: MVVQSPHSVESSVPTAKNGGRRGKGSSEKENVSGQRRASARLLAAKERAEKALSIKRGVEVLDPEEDGGRSRKKANVESEKLNLKDAQNGSPKLPETEPKINEKVAKMVERAAKIAEGLDTTNANAPNVVEKSAHIKVKETIRLFNKHYLHFVQEEEKRCGAVKVGKKAPKGKKTKKRDVSEGDGKGKAKRPDLKAITKMMEKNEVLYPEKTIGSLPGIDVGHRFYSRAEMVAVGFHSHWLNGIDYMGQSYKKGEYEHYIFPLGVAIVLSGMYEDDLDNAEDVVYTGQGGHDLTGNKRQIRDQVMERGNLALKNCVDQGVPVRVVRGHESASSYSGKIYTYDGLYKVVKYWAEKGISGFTVFKYRLRRLEGQPTLTTSQVQFTYGRVPKCPSEIRGLVCEDLSGGQEDVPIPATNLVDDPPVAPTGFTYCKSMKVARNIKLPSNAAGCDCKGVCWDPKACACARLNGSDFPYVHRDGGRLIEAKHIVFECGPKCRCNANCVNRTSQRGLKYRLEVFRTPKKGWAVRSWDFIPAGAPVCEYIGVLTRTEELDNVSENNYIFDIDCLQTMRGLGGRERRQQDASLPMIQNMDKNDEQRSESVPEFCIDAGSFGNVARFINHSCEPNLFIQCVLSAHQDFKLARVMLFAADNIPPLQELTYDYGYALDSVYGPDGKVKRMTCYCGAEDCRKRLF |